DNN-Based Amplitude and Phase Feature Enhancement for Noise Robust Speaker Identification
نویسندگان
چکیده
The importance of the phase information of speech signal is gathering attention. Many researches indicate system combination of the amplitude and phase features is effective for improving speaker recognition performance under noisy environments. On the other hand, speech enhancement approach is taken usually to reduce the influence of noises. However, this approach only enhances the amplitude spectrum, therefor noisy phase spectrum is used for reconstructing the estimated signal. Recent years, DNN based feature enhancement is studied intensively for robust speech processing. This approach is expected to be effective also for phase-based feature. In this paper, we propose feature space enhancement of amplitude and phase features using deep neural network (DNN) for speaker identification. We used mel-frequency cepstral coefficients as an amplitude feature, and modified group delay cepstral coefficients as a phase feature. Simultaneous enhancement of amplitude and phase based feature was effective, and it achieved about 24% relative error reduction comparing with individual feature enhancement.
منابع مشابه
Adversarial Network Bottleneck Features for Noise Robust Speaker Verification
In this paper, we propose a noise robust bottleneck feature representation which is generated by an adversarial network (AN). The AN includes two cascade connected networks, an encoding network (EN) and a discriminative network (DN). Melfrequency cepstral coefficients (MFCCs) of clean and noisy speech are used as input to the EN and the output of the EN is used as the noise robust feature. The ...
متن کاملRobust Speaker Verification System with Anti-spoofing Detection and DNN Feature Enhancement Modules
This thesis focuses on the robustness issues of speaker verification (SV) systems. Although current SV systems perform well under clean condition, their performance degrades dramatically under real-world uncontrolled environments. The reliability of current SV systems is also questionable under spoofing attacks. These pitfalls severely limit it’s deployment in many applications. This thesis pre...
متن کاملMissing features detection and handling for robust speaker verification
This paper addresses the problem of robust textindependent speaker verification in the presence of missing (masked by noise) features. It presents and assesses several missing feature handling approaches. In these approaches, the speech enhancement and missing feature detection are based on the minimum mean-square error (MMSE) spectral amplitude estimator of Ephraim and Malah [1].
متن کاملSpeaker feature extraction from pitch information based on spectral subtraction for speaker identification
Robust speaker feature extraction under noise conditions is an important issue for application of a speaker recognition system. It is well known that LPC cepstrum, which expresses the spectral envelope, is e ective for speaker recognition. This implies that the spectral rough structure is e ective for speaker recognition. However, LPC cepstrum is a noise-sensitive feature. On the other hand, sp...
متن کاملDeep neural network-based bottleneck feature and denoising autoencoder-based dereverberation for distant-talking speaker identification
Deep neural network (DNN)-based approaches have been shown to be effective in many automatic speech recognition systems. However, few works have focused on DNNs for distant-talking speaker recognition. In this study, a bottleneck feature derived from a DNN and a cepstral domain denoising autoencoder (DAE)-based dereverberation are presented for distant-talking speaker identification, and a comb...
متن کامل